The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Search results for "distribution:Lingua-Interset Lingua Treebank"

Lingua::Interset - DZ Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped. River stage one • 1 direct dependent • 5 total dependents

DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. Individual tagsets are mapped to the Interset using specialized ...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::LA::It - Driver for the positional tagset of the Index Thomisticus Treebank. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Index Thomisticus Treebank in CoNLL format. The original tags are positional, there are eleven positions. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::UG::Udt - Driver for the tagset of the Uyghur Dependency Treebank. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the part-of-speech tagset of the Uyghur Dependency Treebank....

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::CS::Pdt - Driver for the tagset of the Prague Dependency Treebank. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the part-of-speech tagset of the Prague Dependency Treebank....

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::RO::Rdt - Driver for the tagset of the Romanian Dependency Treebank (RDT). River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Romanian Dependency Treebank (RDT). The original RDT annotation is *not consistent:* Four of the twenty POS tags and one dependency type appear only in the first 6% of the material, reducing significantly the POS...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::CS::Pdtc - Driver for the tagset of the Prague Dependency Treebank Consolidated. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the part-of-speech tagset of the Prague Dependency Treebank Consolidated (PDT-C)....

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::EN::Penn - Driver for the tagset of the Penn Treebank. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the part-of-speech tagset of the Penn Treebank....

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::FI::Turku - Driver for the Finnish tagset from the Turku Dependency Treebank. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Finnish tagset from the Turku Dependency Treebank. Tag is a sequence of features separated by vertical bars. There are just the feature values, not attribute-value pairs....

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Atom - Atomic driver for a surface feature. River stage one • 1 direct dependent • 5 total dependents

Atom is a special case of a tagset driver. As the name suggests, the surface tags are considered atomic, i.e. indivisible. It provides environment for easy mapping between surface strings and Interset features. While Atom can be used to implement dri...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::CS::Ridics - Driver for the tagset of the Prague Dependency Treebank Consolidated. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Prague-derived part-of-speech tagset used by the Research Infrastructure for Diachronic Czech Studies (RIDICS, Výzkumná infrastruktura pro diachronní bohemistiku, https://vokabular.ujc.cas.cz/). It is a positional tagset simil...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::RU::Syntagrus - Driver for Syntagrus (Russian Dependency Treebank) tags. River stage one • 1 direct dependent • 5 total dependents

Interset driver for Syntagrus (Russian Dependency Treebank) tags....

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::EU::Conll - Driver for the tagset of the Basque Dependency Treebank in the CoNLL format. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Basque Dependency Treebank version 2011 in the CoNLL format. Note that this version of the tagset is slightly different from the Basque data of the CoNLL 2007 Shared Task. For instance, the features now contain f...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::FA::Conll - Driver for the tagset of the Persian Dependency Treebank (in the CoNLL-X format). River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Persian Dependency Treebank (in the CoNLL-X format). CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. Tagset documentation is ...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::HU::Conll - Driver for the Hungarian tagset of the CoNLL 2007 Shared Task (derived from the Szeged Treebank). River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Hungarian tagset of the CoNLL 2007 Shared Task. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. For Hungarian, these values are derived fro...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::SV::Conll - Driver for the tagset of the Swedish treebank from the CoNLL 2006 Shared Task (Talbanken / Mamba). River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Swedish treebank (Talbanken) from the CoNLL 2006 Shared Task. It was derived from the two-letter tags of the Mamba tagset. The sv::conll driver only handles a slight change in formatting. CoNLL tagsets in Interse...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::LA::Conll - Driver for the tagset of the Latin Dependency Treebank in CoNLL format. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Latin Dependency Treebank in CoNLL format. The original tags are positional, there are nine positions. This driver covers a format that we used in HamleDT processing where the input was first converted to CoNLL. ...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::UR::Conll - Driver for the tagset of the Hyderabad Urdu Treebank, as used in the CoNLL data format. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Urdu treebank from Hyderabad, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS and FEAT. In the case of Urdu, t...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::GRC::Conll - Driver for the tagset of the Ancient Greek Dependency Treebank in CoNLL format. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Ancient Greek Dependency Treebank in CoNLL format. The original tags are positional, there are nine positions. This driver covers a format that we used in HamleDT processing where the input was first converted to...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::IT::Conll - Driver for the Italian tagset of the CoNLL 2007 Shared Task (derived from the ISST, Italian Syntactic-Semantic Treebank). River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Italian tagset of the CoNLL 2007 Shared Task. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. For Italian, these values are derived from th...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC

Lingua::Interset::Tagset::JA::Conll - Driver for the Japanese tagset of the CoNLL 2006 Shared Task (derived from the TüBa J/S Verbmobil treebank). River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Japanese tagset of the CoNLL 2006 Shared Task. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. For Japanese, these values are derived from ...

ZEMAN/Lingua-Interset-3.015 - 05 Mar 2022 11:01:56 UTC
39 results (0.078 seconds)